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DESCRIPTION 

METHODS AND COMPOSITIONS FOR FULL-LENGTH cDNA CLONINO 

Cross feferenccd to a related application 
This application is a continuatlon-iii-part of co-pendiitg VS. application Serial No. 
08/582^62, ffled January 3, 1996. 

Field of the Invention 

The present invention relates to improved technology for selectively synthesizing full- 
length cDNA having complete sequence information of fiiU-length mRNA. 

Background of the Invention 

A basic technology in the field of molecular biology is the conversion of poly(A)-t-RNA 
(mRNA) to double-stranded (ds) complementary DNA (cDNA), which then can be inserted into 
a cloning vector for generating a cDNA library or expression in an appropriate host cell. Advances 
in cDNA library construction technology have made possible the discovery and production of a 
wide range of biologically important proteins. 

Several procedures for generating cDNA libraries which have been used during the last 
15 years are comprehensively reviewed in Wu, ed. Methods in Emymohgy (1987), vol 151 For 
the most part. cDNA library construction technologies use poly(A)4-RNA as a starting material. 
Hie intact poty(A)+RNA is characterized by a polyadenylated "tail** at its 3' end and a 
characteristic "CAP struaure or cap site** at the 5' end. A critical requirement for cDNA libraiy 
construction is to completely copy poIy(A)+RNA to full-length cDNA and retain the complete 
sequence information on the struaure of the protein encoded by mRNA 

One generalized and commonly used method by which the poly(A) +RNA is copied into 
cDNA employs reverse transcriptase, which starts at the 3' end of the mRNA from an oligo d(T) 
primer and proceeds towards the 5' end to generate a cDNAmRNA hybrid (Gubler et oL^ 19S3). 
The RNA strand is then removed £rom the hybrid by action of RNase H and a second DNA 
strand is then synthesized using DNA polymerase I. The resulting heterogeneous mixture of ds 
cDNA molecules can then be cloned into suitable recombinant DNA vector molecules using a 
variety of techniques. Unfortunately, for the majority of mRNAs, this method docs not allow for 
the synthesis of *'full-length" cDNA because reverse transcriptase can not efficiently copy them 
into full-length cDNAs. The efBdency of copying is inversely proportional to the length of 
mRNA; thus, the problem of "full-length" cDNA synthesis is more acute for longer mRNAs, 
Moreover, the current technology can generate deletions at the 5' and 3' ends of the cDNA 
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In an alternative approach, poly(A) tails of mRNA molecules are first annealed to oligo 
(dT) linking with linearized vector DNA (vector priiner)(Okayama et aL, 1982; Pruitt, 
International Patent, ^pL No. 89110816.9). Then, the first strand of cDNA synthesized by 
reverse transcriptase is tailed at the 3' end by oligo dt which facilitates subsequent cloning by 
circularization into vector primer. This method also generates greater numbers of cDNA clones 
that contain truncated cDNAs due to non-fulMength cDNA synthesis. 

As a result of the shortcomings using present technologies to generate conventional 
cDNA libraries, the majority of the cDNA clones lack sequences close to the 5' end of the 
mRNAs. This results in a loss of important information required to make functional proteins. 
TWO selection procedures have been developed in efforts to enrich cDN A libraries for "full-length* 
cDNA clones. In CAP retention procedure (CAPmre) cap-binding protein (eukaiyotic initiation 
factor 4E) in combination with RNase A was used to purify full-length cDNArmRNA hybrids 
(Edciy €t al, 1995; Sonenberg etaL, US. Patent No. 5,219,989). Shorter duplexes corresponding 
to non-fuU-length cDNA fragments are not selected, since the CAP structure of mRNA is 
removed from the RNA moiety by nuclease treatment. Although the CAPture method could 
potentially enrich cDNA libraries for clones containing the authentic 5' ends, the yield of full- 
length cDNA is very low, especially for long cDNAs (1-5%). The low yield is a significant 
disadvantage for this technology. 

In the "oligo-capprng** method, the CAP structure of mRNA is selectively replaced with 
an oligoribonucleotide, thus generating chimeric oUgonucleotide/fiill-length mRNA intermediates 
which are subsequently used for synthesis of full-length cDNAs (Maruyama etal, 1994; Fromomt- 
Racine et at, 1993; Kato et aL, International Patent, Publ. No. 0 625 572 Al, AppL No, 
939210613). However, this method is complicated, involving treatment of mRNA with an alkaline 
phosphatase, decapping mRNA with tobacco add pyrophosphatase, and ligation of the 
oligonucleotide to the 5' end of mRNA by T4 RNA Ugase. These multiple enzymatic steps 
degrade mRNA, thereby generating incomplete cDNA fragments for subsequent cloning 
procedures. Size distribution of cDNA inserts in cDNA libraries generated using the **oligo- 
capping" method is typically less than 3 kb. This is much less than full-length mRNA size 
distribution (Kato et al, 1994) and indicates the low efficiency of "full-length" cDNA cloning by 
"oligo-capping" technology. 

As can be understood from the foregoing, conventional methods for constructing cDNA 
libraries containing foil-length cDNA clones are restricted by low efficiency and the use of 
multiple, time-consuming steps. Accordingly, a simple method that would generate high quality, 
full-length cDNA from RNA is highly desirable. 
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Brief Summ ar y of the Invention 

The present invention provides an advantageous method for synthesis and cloning of fuU- 
lengih cDNA, or cDNA fragments, that corresponds to the complete sequence of 5 '-ends of 
mRNA molecules. The method of the present invention comprises contacting RNA with a cDNA 
synthesis primer which can anneal to RNA, a suitable enayme which possesses reverse 
transcriptase activity, and a template switching oUgonucleotide under conditions sufficient to 
permit the template-dependent cjrtension of the primer to generate an mRNA-cDNA hybrid. The 
template switching oligonucleotide hybridizes to the CAP site at the 5'-end of the RNA molecule 
and serves as a shorty extended template for CAP-dcpcndent extension of the 3'-cnd of the ss 
cDNA that is complementary to the template switching oligonucleotide. The resulting fiiU-length 
ss cDNA includes the complete 5'-end of the RNA molecule as weU as the sequence 
complementary to the template switching oligonucleotide, which can then serve as a universal 
priming site in subsequent amplification of the cDNA 

The subjea invention also concerns novel template switching oligonucleotides that can 
be used according to the subject method. Kits containing the template switcMng oUgonucleotide 
are also included within the scope of the present invention. 



Brief Description of the Drawings 

Figure 1 shows a representation of a mechanism of template switching technology 
according to the subjea invention. 

Figure 2 shows a representation of a procedure using the methods and materials of the 
present invention for cloning 5'-end sequence of full-length DNA. 

Figure 3 shows a representation of PCR-based methods and materials of the present 
invention for full-length cDNA library construction according to the subject invention. 

Figure 4 shows a representation of CAPswitch fuU-Iength cDNA library construction 
technology based on PRE adaptor-primer strategy. 

Detailed Disclosure of the Invention 
The subject invention concerns compositions and methods for constructing cDNA 
libraries from nanogram quantities of total or poly A+ RNA The compositions and methods 
employ template switching oligonucleotides described herein. The method of the present 
invention comprises contacting RNA with a primer which can anneal to RNA, a reverse 
transcriptase, and a template switching oligonucleotide under conditions sufficient to permit the 
template-dependent extension of the annealed primer to generate an mRNA-cDNA hybrid. The 
template switching oligonucleotide can hybridize to the CAP site at the S'-end of the RNA 
molecule and serve as a short, eactended template for CAP-dcpendenl extension of the 3*-end of 
the ss cDNA that is complementary to the template switching oligonucleotide. The resultmg full- 
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length ss cDNA includes the complete 5'-end of the RNA molecule, as well as the sequence 
complementary to the template switching oligonucleotide. The template switching complementary 
sequence can then serve as a utiiversal priming site in subsequent amplification of the cDNA. 

Specifically, the subject invention provides a method for synthesis of full-length single* 
5 stranded (ss) cDNA, or ss cDNA fragments, from RNA The cDNA synthesized in the present 

method has an arbitrary anchor sequence at the 3' end, followed by a nucleotide sequence 
complementary to the RNA molecule starting from the cap site of the mRNA. In a preferred 
embodiment, the process of the subject invention comprises the following steps: 

1. Incubating a sample of poly(A)+RNA or total RNA in the presence of a cDNA 
10 synthesis primer (CDS primer) which can anneal to mRNA and an enzyme which 

possesses reverse transcriptase activity under conditions sufficient to permit the 
template-dependent extension of the primer to generate an mRNA-cDNA hybrid; 
and 

2. incubating the first-strand cDNA synthesis mixture obtained from step 1 with a 
15 template switching oligonudeoiide of the present invention (also refened to 

herein as a "CAPswitch oligonucleotide*), which can provide CAP-dependent 
extension of full-length cDNA by reverse transcriptase using the template 
switching oligonucleotide as a template, and thereby adding nucleotide sequence 
complementary to the template switching oligonucleotide to the 3'-end of full- 
20 length cDNA (refened to herein at "anchored cDNAtraRNA hybrid"). The 

template switching oligonucleotide has a pre-seleaed arbitrary nucleotide 
sequence at its 5*-cnd and at least one riboguardne residue at its 3*-end. 
Steps 1 and 2 of the method are separated only in time. In a prefened embodiment, step 1 is 
followed by step Z However, it would be understood that the first-strand cDN A synthesis mixture 
25 from step 1 can include a template switching oligonucleotide which will be used at step 2* 

Alternatively, a template switching oligonucleotide can be added to the reaction mixture at the 
time of or after first-strand cDNA synthesis. In a preferred embodiment, the cDNA synthesis 
primer is a modified oligo(dT) primer. 

The scope of the present invention also includes a method for isolating a full-length 
30 cDNA fragment corresponding to a 5'-end of target mRNA(s) using anchored cDNA:mRNA 

hybrid generated at step 2 as a template. This method comprises the embodiment of steps 1 and 
2, followed by either alternative step 3A or step 3B, described below: 

3A. Incubating an anchored cDNA:mRNA hybrid generated at step 2 with a 
combination of (a) an oligonucleotide primer corresponding partially or 
35 completely to the nucleotide sequence of the template switchuig oligonucleotide, 

(b) oligonucleotide primcr(s) which is complementary to a nucleotide sequence 
of the target(s) mRNA, and (c) an effective amount of other reagents necessary 
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to perform polymerase chain reaction (PGR). The incubation is conducted under 
conditions sufficient to perform PCR and thereby generate amplification product 
corresponding to the 5 '-end fuU-length fragment of target cDNA- 
3B. Treating anchored cDNA:mRNA hybrid of step 2 under conditions in which a 
second cDNA strand is synthesized, using the first anchored cDNA strand as a 
template. 

Also within the scope of the present invention is a method for generating cDNA libraries 
containing fuU-lenglh cDNAs. This method uses as a template anchored cDNA:mRNA hybrid 
generated at step 2, This method comprises the embodiment of steps 1 and 2, followed by cither 
altemaxive step 3C or step 3D, described below: 

3C. Incubating anchored cDNA:mRNA hybrid generated at step 2 with a 
combination of primers corresponding partiaUy or completely to the sequence of 
template switching oligonucleotide and cDNA synthesis primer, respectively* and 
an effective amount of other reagents necessary to perform PCR. The incubation 
is conducted under conditions sufficient to perform PCR to generate 
amplification product corresponding to the representative library of fuU-length 
ds cDNA. 

3D. Treating anchored cDNArmRN A hybrid of step 2 under conditions in which a 
second cDNA strand is synthesized, using the first anchored cDNA strand as a 
template. 

In one aspect of the present invention, the resulting cDNA product generated at step 2 or 3 can 
be inserted into recombinant cloning vehicles, and hosts can be transformed with these vehicles 
according to conventional methods well known m art (Kimmel et at, 1987). 

The subject invention enables synthesis of fuU-lengih cDNA, which has been difficult to 
synthesize by conventional methods, The present invention includes the novel step 2 described 
above, which can be utilized in standard cDNA preparation/cloning procedures which are well 
known in the art The use of template switching oUgonucleotides in the subject method 
advantagously allows for negative selection against cDNAs that are not complementary with the 
5*-end of template RNA, whereas full-length cDNAs can be readUy selected. Moreover, the 
subject methods can significantly simplify cDNA synthesis and cloning. Since the cDNA clones 
obtained from the fuU-Iength cDNA library prepared according to the present method contain the 
complete information for the primary stnirturc of the protein, the invention also relates to a 
process for using the clones, obtained from said fuU4ength cDNA library to produce the encoded 
proteins. 

In another aspect, the invention provides methods where the resulting cDNA product 
generated at step 3 can be used as a starting material for use with cDNA subtraction methods. 
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Specifically, the method of the subject invention can be used in conjunction with cDNA 
subtraction procxlures to prepare a cDNA population containing highly enriched representation 
of cDNA species that are present in one DNA population (the tester population), but that are 
less abundant or absent in another DNA population (the driver population). Preferably, tester 
and driver ds cDNA amplified by methods and materials of the present invention is used in 
combination with Suppression Subtractive Hybridization technology described by Chenchik et al, 
(US Patent No. 5,565,340). Other methods of subtractive hybridization, described for example, by 
Wigler et aL (U.S. Patent No. 5,436,142); Hampson et al (NucL Adds Res. 20:2899 (1992)); Yang 
tt al. (Anal. Biochem. 237:109-114(1996)); Balzer et al (Nucl. Adds Res. 22:2853-2854(1994)), 
and others, can also be employed. 

The ds cDNA prepared according to the present invention can also be used as a 
hybridization probe. As used herein, the term "hybridization probe" means that cDNA generated 
from total RNA isolated firom healthy, diseased or infected organisms, or as subtracted as 
described by Chenchik et aU (US Patent No. 5,565,340), may be labeled by radioisotopes, 
fluorescent and other reporter groups by conventional chemical or enaymatic labeling procedures. 
Labeled cDNA can then be used in standard bybridizaUon assays known in the art, lc, the labeled 
cDNA is contaaed with the deOned oligonucleotide/polynucleotides corresponding to a particular 
set of the genes inunobillzed on a solid surface for a sufQcient time to permit the formation of 
patterns of hybridization on the surfeces caused by hybridization between certain polynucleotide 
sequences in the hybridization probe Math the certain immobilized deOned 
oligonudeotide^lynucleotides. The hybridization patterns using available conventional 
techniques,suchasscintillationcounting,autoradiography,fluorescencemeasurcment,calorimetric 
measurement, or light emission measurement. Techniques and conditions for labeling, 
hybridization and detection are well known in the art (see, eg:, Maniatis ex c/., 1989; Keller et al,, 
1993). 

Labeled cDNA can also be used for identifying genes which are differentially cjcpressed 
in two different pre-determined states of an organism (see, eg;, Mattiatis et 1989; International 
Patent WO 95/21944). 

The ss cDNA generated after the first-strand cDNA synthesis-template switching 
procedure (step 2) or the ds cDNA generated after 5*-RACE amplification step (step 3) can be 
used as a hybridization probe for selectively recovering a desired complementary urget nucleic 
acid molecule firom a mixture or library of ss or ds nucleic add containing said molecule. The 
method wherein the hybridization probe is haptenylated and used for obtaining an enriched source 
of target nucleic acid molecule based on hybridization approach as described in details in U.S. 
Patent No. 5,484,702 of January 16, 1996 and U.S. Patent No. 5,500,356 of March 19, 1996 or 
RecA protein-mediated hybridization as described in details in U.S. Patent No. 4,S88,274 of 
December 19,1989. 
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Also witMn the scope of the present invention are template switching or "CAPswitch" 
oUgonucIeotides useful for the preparation of cDNA Ubraries containing fuU-length cDNA clones. 
The CAPswitch oligonucleolides have at least two functions. One function is the ability to 
selectively interact with fuU-length intermediates of reverse iranscriptase-mRNA-cDNA which are 
generated at the 5 ' end of fuU-length mRN A after first-suand cDN A synthesis, A second function 
of the CAPswitch oligonucleotides of the subject invention is as an efficient template for reverse 
transcriptase from the above-mentioned full-length intermediates which can aUow 5' mRNA end 
(in most cases CAP-dependent) extension of ftiU-length cDNA by reverse transcriptase using 
CAPswitch oligonucleotide as a template, A sequence complementary to CAPswitch 
oligonucleotide can thereby be added to the 3 '-end of full-length cDNA, 

Also within the scope of the present invention are modifications in the structure or 
sequence of CAPsvntch oHgonuclcotide which can provide an advantage for selective binding to 
the CAP structure of mRNA. One modification, for example, can include covalenily bindmg 
CAPswitch oligonucleoUdes with a protein capable of binding the CAP structure of mRNA (see 
U.S. Patent No, 5,219,984). 

The invention parUcularly concerns the embodiments of the above methods wherein the 
CAPswitch oligonucleotide is represented by the following formula: 

5'-dNi-dN2-...dNm-rNi-TN2...rNn-3' 
wherein dN represenu a deojgribonucleotide selected from among dAMP. dCMP, dGMP and 
dTMP; m represents an integer equal to or greater than zero, preferably from 10 to 50; rN 
represents a ribonucleotide selected from among AMP. CMP, GMP and UMP, preferably GMP; 
and n represents an integer of at least one or greater, preferably from 3 to 7. Some obvious and 
weU known in the art modifications in the structure of the CAP switch oUgonucleotide, such as 
replacement of 1-10 nucleotides with random nucleotides, nucleotide analogs, or nucleotides 
labeled with different hapten groups (such as, for example, biotin, digoxigenin. and fluorescein), 
incorporation of a terminator nucleotide (such as, for example, S'-amino NMP, 3'-phosphate 
NMP, and 3'-Quoro NMP), using parUally double-stranded DNA containing extension of a single- 
strandedCAPswitchoUgonucleotidesequence5'-dNi^N2-.»dNm-rNi.rN2...rNn-3\incorp^^ 
of restriction sites, and incorporation of promoter sequences for bacteriophage RNA polymerase, 
which simpUfy subsequent purification and cloning of the cDNA but which stiU retain subtantiaUy 
the same functional activity as an unmodified CAPswitch oygonucleotide, ie., mRNA 5'-end 
extension of fuU-length cDNA by reverse transoiptase using CAPswitch oligonucleoride as a 
template arc within the scope of present Invention. 

Also included within the subject invention are cDNA library construction kits, for 
example, library construcfion kits whidi include the novel oligonucleotides according to the 
subject invention for use with PCR procedures. 
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The subject invention also concerns novel template switching oligonucleotides that can 
be used according to the subject method* 

Also included within the scope of the present invention are kits that include in one or 
more containers template switching oligonucleotides of the invention. 

5 

CAPswitch technology. The methods and materials of the present invention is primarily 
based on the use of unique CAPswitch oligonucleotides in cDNA synthesis. The subject cDNA 
synthesis can include a step of 15rst-strand cDNA from poIyA'**RNA using reverse transcriptase 
coupled with either second-strand cDNA synthesis or PCR amplification in a second step to 

10 generate a high yield of fuU-iength ds cDNA. When included in the first-strand cDNA synthesis 

reaction mixture, the CAPswitch oligonucleotides create a short extended template. In the course 
of first strand cDNA synthesis, the reverse transcriptase enzyme stops at the 5' end of the mRNA 
template. The 5'-end typically includes a 7-methylguaitosine CAP structure present on the 5' ends 
of all eukaiyotic mRNAs. The en^me terminal traxisferase activity then adds a few additional 

15 nucleotides, primarily deo^^cytidine, to the 3 '-end of the of the newly synthsized cDNA strand. 

The CAPswitch oligonucleotide, which has an oligo (rG) sequence at its 3' end, base pairs with 
the deo>{ycytidine-rich stretch of nucleotides present on the cDNA strand, creating an extended 
template. Reverse transcriptase then switches templates and continues synthesis of cDNA that 
is complementary to the CAPswitch oligonucleotide. The resulting t\ill-length ss cDNA 

20 incorporates at its 3' end a sequence which is complementary to both the complete 5' end of the 

mRNA and the CAPswitch oligonucleotide sequence. 

Identified herein is an oligonucleotide structure (CAPswitch oligonucleotide) which can 
provide for an efficient template switching reaction in the course of first-strand cDNA synthesis 
from poly(A)+RNA using, as a donor, 5'-capped full-length mRNA and, as the acceptor, a 

25 chemically synthesized oligonucleotide. Chimeric cDNA products having an oligonucleotide 

sequence at the 3'*end of full-length cDNA were revealed by subsequent amplification (e.g., 5'- 
RACE) using a combination of a gene-specific primer and a primer complementaiy to a portion 
of the template switching oligonucleotide. One set of oligonucleotides according to the present 
invention, having an arbitrary sequence at the 5' end and random sequence at the 3' end, are 

30 represented by the following polynucleotide sequences: 

CzRl 5'.d(TOTAGCGTGAAOACGACAGAA)r(N)i2- 3' (SEQ ID NO. 1) , 

CzR2 5'.d(TGTAGCGTGAAGACGACAGAA(N)ij)r(N)i- 3' (SEQ ID NO. 2) 

C2R3 5'-d(TGTAGCGTGAAGACGACAGAA(N)ji)- 3' (SEQ ID NO. 3) 

CzR4 5'-d(TGTAGCGTGAAGACGACAGAAGGATG(N)9)r(N)j. 3' (SEQ ID NO. 4) 

35 Na21-N4 5'-d(TGTAGCGTGAAGACGACAGAA)r(N)4- 3' (SEQ ID NO. 5) 

Na21-N8 5'-d(TGTAGCGTGAAGACGACAGAA)r(N)8- 3' (SEQ ID NO. 6) 
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NA21-N12 5'-d(TGTAGCGTGAAGACGACAGAA)r(N)ie- 3' (SEQ ID NO. 7) 

wherein d( ) represents a deoxyribonucleotide sequence; (N)ii j^^^ (N)9 ttprescni, respectively, a 
random deoxyribonucleotide sequence 11 or 9 bases long of dAMP, dGMP, dCMP and dTTP in 
each base position; r(N)nfi represents a random sequence 1 to 16 bases long of AMP, GMP, 
CMP and UMP in each base position. 

Based on the efficienqr of amplification of S'-ends (5'-ELA.CE) of four human cDNAs 
(smooth muscle a-aciin» smooth muscle y-actin, cytoskeletal y-actin and transferrin receptor and 
model RNAs with and without cap structure at the 5' end), and subsequent sequence analysis of 
amplified product^ conservative structures have been identified at the 3' end of an oUgonucleotide 
which can be used to generate a highly efficient CAP-dependent template switching reaction. 
Mutational analysis of conservative and non-oonservative regions of the oligonucleotide sequence 
(see the sequences of additional analyzed CAPswitch oligonucleotides. SEQ ID NOS. 19-66) 
revealed that the highest efficiency of CAP-dependent template switchmg was achieved using a 
basic DNA-RNA chimeric CAPswitch oligonucleotide having an arbitrary sequence at its 5' end 
and a conservative oligo ribo(O) sequence at the 3' end. This oligonucleotide is represented by 
the general formula: 

5'-dNx-rGy-3' 

long; and rGy represents an oligo rG sequence 3-5 bases long. The oHgo rG sequence is 
responsible for the main template switching function associated with the CAPswitch 
oligonucleotide. The arbitrary deoj^booligonucleotide sequence can be selected so as to be 
useful for subsequent cDN A synthesis and cloning steps. 

In addition, the template switching reaction is most efficient for cDNAs that have 
sequence complementary to the 5' end of the mRNA in comparison to cDNAs that were 
prematurely terminated during reverse transcription of the first strand and, therefore, do not have 
sequences complementary to the 5* end of the mRNA molecule. The presence of the CAP 
suucture at the 5' end of mRNA is not a necessary requirement for template switching reaction, 
but the template switching reaction is most effective for full-length DNA products synthesized on 
mRNA which has the CAP structure present at the 5'-end. 

Modifications in the structure of template switching oligonucleotides are contemplated 
within the scope of the present mvention. Specifically, those modification which may affea the 
template switching efficiency but in which the template switching oligonucleotide retains the 
substantially the same functional activity as unmodifed template switching oligonucleotides, Le., 
CAP-dependent extension of full-length cDNAby reverse transcriptase using the oligonucleotide 
as a template, is contemplated by the subject invention. The modified oligonucleotides can be 
used as alternatives to unmodified CAPswitch oligonucleotides. The following rules summarize 
these modifications that are useful according to the subject invention: 
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la. The use of shorter (1-2 bases) oligo rG 3 '-end sequences, replacement 
of one or several rG residue(s) for rA, rC or rU. or replacement of oligo rG for oiigo dG 
reduces efficiency of the basic structure; longer oligo rO sequences (7-9 bases) do not 
significantly influence template switching efficiency. 
5 lb. Modification of the 3' terminal G at the 3'-OH group of ribose residue 

by an amino, biotin, phosphate, fluoro or glycerol group can significantly reduce 
background in subsequent PGR amplification steps (step 3A). 

Ic. Changes in the sequence of the arbitrary portion of a template switching 
oligonucleotide, replacement partially or completely deoxyribonucleotides for 

10 ribonucleotides, including restriction site(s)» docs not significantly influence template 

switching effidemy. Using a longer arbitrary sequence (about 22 to 42 bases) at the 3'- 
end of the template switching oligonucleotide reduces the efficiency of template switching, 
whereas shorter sequences (about 15 to 17 bases) slightly increase the efficiency of 
template switching but make subsequent PGR amplification steps (step 3A« 3C) less 

15 efficient. 

Id. A person skilled in this art having the benefit of the current disclosure 
would recognize that other modifications in the structure of template switching 
oligonucleotides of the present invention can be readily prepared* These modifications 
can increase the efficiency and specificity of the CAP-dependent template switching 

20 reaction. For example, using aptamer (random oligonucleotide) selection technology 

(Kenan et 19S»4) it is possible to find ribonucleotide or deoxyribonucleotide sequences 
of the arbitrary portion of the template switching oligonucleotide which efficiently bind 
to the CAP structure of an mRNA molecule and, therefore, increase efficiency of the 
template switching reaction. The same result can be achieved by replacement of natural 

25 nucleotide(s) with modified nucleotides in order to increase the affinity of the template 

switching oligonucleotides binding to the CAP structure. 

Chhneric protehi-iemplate switching oUgonucleoUdes can also be constructed so 
that the protein portion recognizes and binds the CAP structure, which can increase 
efficiency of the template switching reaction. These cap binding proteins or protein 

30 portions are well known in the art and preferably include antibodies against the CAP 

structtire and eukaryotic initiation factor 4E (elF-4E). 



Another advanuge of using the methods and compositions of the present invention is tbe 
high flexibility of this procedure which makes it possible to use this new technology vidth 
35 conventional cDNA cloning procedures well known in art. Advantageously, the subject invention 

can eliminate the need for multiple em^atic or purification procedures used in conventional 
procedures. The subject method can provide CAP-dependent automatic and dh-ect addition of 
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a template switching oligonucleotide sequence to the 5' end of mRNAxDNA hybrid in the course 
of first-strand cDNA synthesis. Moreover, the methods and compositions of the subject invention 
can be combined with procedures well known in the art for cDNA synthesis and cloning. It will 
be apparent to those skilled in the art that the order of some of the individual steps, the exact 
5 structure of CDS primers, and the vectors used for cDNA library construction can be varied. 

Thus, modifications, substitutions, and optimization of the methods and materials disclosed herein 
which provide for the cloning of full-length cDNA or cDNA fragment(s) containing complete 
complementary nucleotide sequence of the S'^end of mRNA molecules are within the scope of 
the present invention. The description below details only preferred steps which result in the 

10 efficient generation of fuU-length cDNA from RNA. 

First-strand cDNA synthesis. Using the subject method with conventional procedures, 
first-strand cDNA synthesis can be carried out using an annealed complex comprising a cDNA 
synthsis primenmRNA as a template for reverse transcription. Primer extension can be catalyzed 
by reverse transcriptase, or by a DNA polymerase possessing reverse transcriptase activity, in the 

15 presence of adequate amounts of other components necessary to perform the reaction, for 

example, deoaqribonucleoside triphosphates ATP, CTP, GTP and TTP, Mg^"*", optimal buffer. A 
variety of DNA polymerases possessing reverse transcriptase activity can be used for the 
first-strand cDNA synthesis. Examples of DNA polymerases that can be used in the methods of 
the present invention mclude the DNA polymerases derived from organisms such as thermophilic 

20 bacteria and arcbaebacteria^ retrovinises, yeast, Neurospora, Drosophila, primates and rodents. 

Preferably, the DNA polymerase is isolated fi^om Moloney murine leukemia virus (M-MLV) (U.S. 
patent 4,943,531) or M-MLV reverse transcriptase lacking RNaseH activity (U.S. patent 
5,405,776), human T-cell leukemia virus type I (HTLV-I), bovine leukemia virus (BLV), Rous 
sarcoma virus (RSV), human immunodefidency virus (HIV) or Thermus aquaticus (Taq) or 

25 Thermus thermophilus (Tth) (U^. patent 5,322,770). These DNA polymerases may be isolated 

from an organism itself or, in some cases, obUined commercially. DNA polymerases useful with 
the subject invention can also he obtained from cells expressing cloned genes encoding the 
polymerase. As a starting material for cDNA synthesis, poly(A)-fRNA or total RNA from yeast 
and higher organisms such as plants or animals can be used. 

30 The first-strand cDNA synthesis step of the subjea method can include template 

switching oligonucleotides of the present invention in the reaction mixture, but are not a necessary 
component for carrying out first-strand cDNA synthesis. The template switching oligonucleotides 
can be added at the time of the template switching step which follows the first-strand synthesis 
step. Thus, it is understood that template switching oligonucleotide molecules can be included 

35 in the first-strand reaction composition (for example, during cDNA synthsis primer annealing to 

RNA or when contacting the RNA with an enzyme possessing reverse transcriptase activity) or 



wo 97/24455 



PCT/US97/00368 



12 

the oligonucleotides can be added in the course of, or after completion of, the first-strand cDNA 
synthesis reaction* 

Depending on the strategy to be employed for cDNA cloning, numerous cDNA synthesis 
primers structures can be used for the first-strand cDNA synthesis using poly(A)+RNA as a 
template and catalyzed by the reverse transcriptase activity. The cDNA synthesis primer can be 
a single-stranded oligonucleotide, a double-stranded oligonucleotide having a single-stranded 
portion (primer-restriction-end or PRE, adapter, as described by Coleclough et aL, 1985), or a 
vector primer, representing ds vector with a single-stranded portion (Okayama et al^ 1982), In 
all three cases, a single-stranded portion of the cDNA synthesis primer is responsible for binding 
with poly(A)+RNA and initiating the first-strand cDNA synthesis. In a preferred embodiment 
of the subject method, for fuU-lenglh cDNA h'brary construaion, a CDS primer containing an 
oUgo dT tail at the 3'-end of the primer is annealed to the poly(A) portion of mRNA. For rapid 
amplification or cloning of 5' cDNA ends and for selective cloning of particular genes, the CDS 
primer can possess a random sequence or arbitrary sequence which may correspond to a particular 
sequence of a target gene which is to be cloned. 

The subject invention particularly concerns the en:ibodtments of the above methods 
wherein the CDS primer can be annealed to: 

la. The poWfA^ tail of polvrA^-t-RNA The cDNA synthesis primer can be 
selected from a single-stranded oligonucleotide, any partially double-stranded DNA 
fragment, or any linear vector primer. In a preferred embodiment, the oligonucleotide 
primer has the sequence: 5'^i-dN2-..*dNm-dTYi-dNi-dN2-...dNp-3' wherein m represents 
an integer 0 and above, preferably from 0 to 20; n represents an integer 8 and above, 
preferably from 8 to 30, p is preferably from 0 to 3; dN represenu a deojQrribonucleotide 
selected firom or represent mkture of dAMP, dCMP, dOMP, and dTMP; dT represents 
dTMP, Some modifications in the structure of the primer such as replacement of 1-10 
nucleotides for nucleotides containing different hapten groups (biotin, digoxigenin, 
fluorescein, etc.), nucleotide analogs, ribonucleotides^ non-natural nucleotides, 
incorporation of restriction sites, bacteriophage RN A polymerase promoter region which 
simplify subsequent purification, using and cloning cDNA but still retain the main 
function of the primer , i.c., priming activity from poly(A) portion of poly(A)+RNA, are 
within the scope of present invention. Using a partially double-suanded DNA primer or 
linear plasmid vectors havingasingle-siranded tail sequence5'-dNj-dN2-...dNm-dTn-dNi- 
dN2-...dNp-3', described above, and possessing priming activity for fixsi-strand cDNA 
synthesis from poly(A) portion of poly(A)+RNA» are also considered as part of the 
subject invention. In order to sunplify the cloning procedure, the CAPswitch 
oligonucleotide can be attached to the other end of vector primer. In this case, the veaor 
primer will possess at one end a sequence corresponding to the CDS primer and, at the 
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other end, the CAPswitch oUgonucleotide (CAPswitch-vector primer technology). 
Subsequent cDNA synthesis and automatic template switching wiU generate cDNA:vector 
chimeric product, which can be easily cloned as described Okayma et al, 1982. 

lb. Inner. non-polv<^A> portinn rtf the mRNA. These oligonucleotide 
primer(s) have the general formula dNi-dN2-...dNq. where dN represents a 
deoxyribonucleotide selected from among dAMP, dCMP, dGMP, and dTMP or represent 
a mixture of 2-4 of these bases; and q represents integer 6 and above, preferably from 6 
to 50. These primers can have a random sequence, ic, annealed to all mRNAs^ an 
arbitrary sequence, lc, annealed to at least one arbitrary mRNA, or a sequence 
complementary to at least one mRNA. Also, the sequence of these primer(s) can include 
a restriction sitc(s) or modified bases (for example, biotinylated) to feciUtate subsequent 
purification or cloning procedure. 

Second-strand cDN A synthesis and/or PCR amplification . First-strand cDNA synthesis 
based on CAPswitch technology generates a full-length (or the corresponding 5' end of a fuU- 
length fragment) mRNA:cDNA hybrid molecule intermediates flanked by a CDS primer at its 
3' end, and a CAPswitch oligonucleotide at its 5' end. Such intermediates can be easOy converted 
lo a ds cDNA form suitable for subsequent cloning using convenUonal procedures. These 
procedures are well known and include: 

1. Direct amplification of fuU4ength cDNA by combination of PCR primer corresponding 
CDS and CAPswitch flankmg portions of mRN A'cDNA hybrid and effective amount of 
other reagents under conditions necessary to perform PCR. Preferably, the conditions arc 
those developed for amplification of long nucleic acid sequences and described by Cheng 
(Intemadonal Patent (1995)) and Barnes (U.S. Patent No. 5,436.149 of July 25, 1995). 

2. Replacement of the mRNA portion of the mRNAcDNA hybrid with a second-strand 
cDNA essentiaUy as described by Okayama et al (1982) and Gublcr et ^1(1983). This 
process entails digestion of the RNA with a ribonuclease such as E. coli RNase H. repair 
synthesis using a DNA polymerase having the acthrities of DNA polymerase 1, and 
ligation. The procedure depends on the structure of the CDS primer used for the first- 
suand cDNA synthesis. Second-strand cDNA synthesis can be carried out using as a 
template mRNAcDNA hybrid or mRNAxDNA^-vector chimeric product using vector 
primer for first-strand synthesis (CAPswitch-vector primer technology^ see above). 
Alternatively, mRNAcDNA hybrid generated by PRE adaptor sUategy, described above, 
can be digested at the 5' and 3' flanking sequences, which correspond to PRE adaptor 
and CAPswitch oligonucleotide by at least one restriction emyme. and then ligated into 
a conventional vector digested by the same restriction enzyme(s). Any restriction 
enzyme(s) can be used as long as it docs not cut within mRNAcDNA hybrid. 
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Cloning into vector . In the case of xising a vector primer (CAPswiich-vector primer) or 
PRE adaptor strategy, the ds cDNA generated after second-strand cDNA synthesis is aheady 
inserted into the vector and does not require this step. When an oligonucleotide CDS primer is 
used, the ds cDNA prepared in the sccsond step by PGR or by mRNA replacement technology can 
5 be ligated with adaptors or digested with restriction enzyme(s) in sequences corresponding to CDS 

and CAPswitch oligonucleotide flanldng portions, thus generating ds cDNA molecules which will 
be ligated to any conventional cloning vector (including plasmid, cosmid, phage, retroviral vector 
and so on) after digesting it with the same restriction en2^e(s). 

Then, recombinant DNA molecules comprising a full-length cDNA library can be 
10 introduced into prokaiyotic hosts and, optionaUy, eukaiyotic hosts, useful in the high frequency 

cloning of fuU-lengih ds cDNA and in the expression of recombinant proteins therefirom. 

Once cloning is completed according to the invention, the desired clone(s) can be 
deteaed by labeled probe, monoclonal or polyclonal antibodies prepared against the product in 
a conventional immunoassay or enriched for desired target by hybridization seleaion approach, 
15 described for example by Li er in International Patent WO 95/04745 of 9 August 1994. 

Summary. Use of CAPswitch oligonucleotide in cDNA synthesis and cloning significantly 
simplify and improve technology of fuU-iength cDNA library construction. The main benefits are 
as follows: 

1. The one-stage procedure which Includes first-strand cDNA synthesis and addition of a 
20 defined sequence to the 3' end of cDNA which significantly reduces the number of steps 

(fi-ora 5-7 to 2-3 steps) necessary for conventional PCR-bascd standard cDNA library 
construction technology. A lower number of steps means that the novel methods and 
materials of the present invention is more efficieat, easier, less labor-intensive, and more 
reproducible than conventional cDNA library construction methods. 
25 Z In accordance with the present invention, the CAP-dependent template switching 

mechanism provides significantly more efficient technology for synthesizing full-length 
cDNA and generating cDNA libraries mostly containing full-length cDNAs. The methods 
and materials of the present invention can provide a novel method for readily selecting 
the full-length cDNAs to be cloned in the cDNA libraiy. 
30 As used herein, the term "jfull-length complementary DNA or cDNA" is defined as a 

full-length smglc-stranded (ss) or double-stranded (ds) cDNA(s) or cDNA fragmem(s) which 
contain the complete sequence information of the 5'-end(s) of mRNA(s). Full-length cDNA can 
contain the complete sequence information of the 5*-end of a particular (target) mRNA or of the 
whole population of polyA+RNA used as a template for the first-strand cDNA synthesis, 
35 As used herein, the term "full-length cDNA library" refers to the whole population of 

ss or ds cDNAs synthesized from polyA+RNA as a template. The full-length cDNA library can 
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be \ised directly for different appUcations known in the art or it can be cloned into any suitable 
recombinant cloning vehicle, and host can be uansformed with the cloning vehicle. 

The term "template switching" reaction refers to a process of template-dependent 
synthesis of the complementary strand by a DNA polymerase using two templates in consecutive 
order and which are not covalently linked to each other by phosphodiesicr bonds. The 
synthesized complementary strand will be a single continuous strand complementaiy to both 
templates. IVP^caUy* template is polyA+RNA and the second template is a template 

switching or "CAPswitch" oligonucleotide. 

As used herein, the term "arbitrary sequence" or "anchor sequence" refers as any defined 
or pre-selected deoxyribonucleotide, ribonucleotide or mixed deoxyribo/ribonucleotide sequence 
which contains a particular sequence of natural or modified nucleotides. 

As used herein, the term "random sequence" is defined as deoxyribonucleotide, 
ribonucleotide or mixed deoxyribo/ribonucleotide sequence which contains in each nucleotide 
position any natural or modified nucleotide. 

As used herein, the term "full-length anchored cDNA" is defined as a fuU-length ss cDNA 
which has a defined arbitrary sequence at the 3'-end. 

As used herein, the term "mRNA:cDNA hybrid" refers to a product after fiist-strand 
cDNA synthesis catalyzed by reverse transcriptase using polyA+RNA as a template. 
"mRNA-cDNA hybrid" can be full-length if the cDNA portion includes the complete sequence 
of the 5*-ends of the template mRNA. 

As used herein, the term "reverse transcriptase" is defined as any DNA polymerase 
possessing reverse uanscriptase activity which can be used for first-strand cDNA synthesis using 
polyA+RNA or total RNA as a template. 

As used herein, two sequences are said to be "complementary" to one another if they are 
capable of hybridizing to one another to form antiparallel, double-stranded nucleic acid structure. 

As used herein, the terra "hapten" refers to a molecule that is bound to a nucleic acid 
molecule and can be recognized and bound by another molecule, or "binding ligand," e.g., an 
antibody, streptavidin and biotin, transient metal, and others known in the art Examples of 
haptens include chelating group, biotin. fiuorescein, digoxigenin, antigen , and others known in 
the art. 

As used herein, the term "solid support" refers to any known subsuate which can be used 
fbr the Inmiobilization of a binding ligand or oligonucleotide/polynucleotide sequences by any 
known method. 

As used herein, the term "rcponer group" refers to any group incorporated into 
full-length cDNA by conventional chemical or enzymatic labeling procedures and which can be 
detected by use of conventional techniques, such as scintillation counting, autoradiography, 
fluorescence measurement, calorimetric measurement, light emission measurement, and other 
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means known in the art. Examples of reporter groups include radioisotopes, fluorescent^ 
chemifluorescent, chemiluminescent, hapten groups^ and others known in the art 

As used herein, the term "subtractive hybridization" refers to technology which allows for 
the recovery of a cDNA population containing a highly enriched representation of the cDNA 
species that are present in one cDNA population (tester), but that are less abundant or not found 
in another cDNA population (driver). Examples of subtractive hybridization technologies include 
Suppression Subtractive Hybridization technology (Chenchik et al U.S. Patent No. 5^65,340), 
representation difference analysis (Wigler et aly U.S. Patent No. 5,436,142); and linker capture 
subtraction (Yang et al. Anal. Biochenu 237:109-114(1996). 

As used herein, the term "nucleotide analog" refers to a nucleotide which is not typically 
found in the in DNA or RNA and possesses some additional features which can improve 
efficiency of the template switching reaaion or improve the usage of anchored cDNA generated. 
For KKample, suitable nucleotide analogs include modlQcation in the base or sugar-phosphate 
backbone, like peptide nucleic acid, inosin, 5-nitroindole deoxyribofuranosyl, 
5-methyldeoxycytosine, and 5,6-dihydro-5,6-dihydroxydeo^hymidine. Other nucleotide analogs 
will be evident to those skilled in the art. 

Following are examples which illustrate procedures for practicing the invention. These 
examples should not be construed as limiting. All percentages are by weight and all solvent 
mixture proportions are by volume unless otherwise noted. 

Example 1 ^ Preferred Method for Cloning 5'-end Sequences of Full-Lcngth cDNA Based on 
CAPswitch Technolopv 

Obtaining a full-length cDNA is one of the most important, and often one of the most 
difficult, tasks hi characterizing genes. Traditional methods for cDNA libraty construction usually 
produce only partial cDNA fragments. To fadliiate recovery of the rest of the coding sequence, 
an in vitro method for the rapid amplificaUon of cDNA ends (RACE) was proposed in 1988 
(Frohman et ai» 1988). In spite of various modifications which have been developed, the current 
RACE technologies are complicated and inefficient. The methods and materials of the present 
invention provides a method which significantly simplifies and makes more efficient the 5 '-RACE 
procedure. The flow chart which describes CAPswitch-based 5'-RACE procedure is shown in 
Figure 2, and the preferred protocol is described below. Some obvious modifications in protocol, 
e.^, using other enzymes possessing similar enzymatic activities for first-strand synthesis step and 
PCR, using other sequences of oligo(dT) primer and CAPswitch oligonucleotide, using for first- 
strand synthesis a gene-specific primer, instead of a oligo (dl^ primer, all fell within the scope of 
the present invention. 
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Step. 1 First-strand cPNA svntliesiS'tcmplate switching procedure . 10 pmol of cDNA 
synthesis primer (oligo d(T) primer) 

CDSl: 5'-d(TCrAGAATrCAGCOOCCGC(T)3oVN) * 3' (SEQ ID NO, 8) 

(where V«G or A or C; N=0 or A or T or C) 
5 and 50 pmol of CAPswitch oligonucleotide (CSOl): 

CSOl: S'.d(CTAATACGACTCACrATAGGGC)r(GGGp)-3' (SEQ ID NO. 9) 

(where p is 3 '-phosphate group) 

were annealed to 1 /<g of human placenta poly(A)*** RNA (CLONTECH Laboratories, Inc., Palo 
Alto, CA), in a volume of 5 //I of deionized water, by heating the mixture for 2 minutes at 7(y*Q 

10 followed by cooling on ice for 2 minutes. First-strand cDNA synthesis was then initiated by mixiog 

the annealed pnmer-RNA with 200 units of M-MLV RNase H- reverse iranscriptase (Superscript 
n reverse transcriptase. Life Technologies) in a final volume of 10 /il, containiug 50 mM Tris-HCl 
(pH S3 at 22^*0); 75 mM KCl; 6 mM MgCi2; 1 mM DTT; and 1 mM each of dATP. dGTP, dCTP, 
and dTTP, The first-strand cDNA synthesis-template switching reaction was incubated at 42**C for 

15 1.5 hours in an air incubator and then cooled on ice. We also synthesized fiist-strand cDNA using 

random d(N)^ primers (500 ng) or human beta-actin antisense gene-specLGc primen 
ACTl: 5^d(ACTCGTCATACTCCTGCTTX3CrGATCCACATCrGC).S' (SEQ ID NO. 10) 
or human transferrin receptor antisense gene-specific primer: 

TFRl: 5'.d(GTCAATGTCCCAAACGTCACCAGAGA)-3' (SEQ ID NO, 11) 

20 mstead of the oligo d(l^ primer. 

The reaction mixture was then diluted 500-fold by addition of 5 ml of 10 mM Tricine- 
KOH (pH 8.5 at 22'*C) and 0,1 mM EDTA, incubated at 94*C for 1.5 min, cooled on ice, and 
stored at -'20°C 

Step 2. 5'- RACE. PGR amplification was performed using the Advantage KlenTaq 
25 Polymerase Mix (CLONTECH Laboratories, Inc.). This kit contains a mixture of KlenTaq-1 and 

Deep Vent DNA polymerases (New England Bio Labs) and TaqStart antibody (CLONTECH 
Laboratories, Inc.). The TaqStart antibody provides automatic hot-start PGR. Amplification was 
conducted in a 50-/*l volume containing 5 fi\ of diluted first-strand cDNA; 40 mM lYicme-KOH 
(pH 9.2 at 22^C); 3,5 mM Mg{OAc)2; 10 mM KOAq 75 /*g/ml BSA; 200 each of dATP, 
30 dOTP, dCIP, and dTTP; 0.2 fiM each of CAPswitch primer (CSPl); 5'- 

d(CTAATACGACrCACrATAOGGC)^ 3' (SEQ ID NO. 12) and gene-specific primer (GSP 1 
for beta-actin or uansferrin receptor); and 1 /<1 of 50x KlenTaq Polymerase Mix. Temperature 
parameters of the PGR reactions were a& follows: 1 minute at 94''C followed by 5 cycles of 94**C 
for 30 seconds and 72**C for 5 minutes; then 5 cycles of 94**C for 30 seconds and 70**C for 5 
35 minutes; then 25 cycles of 94'C for 30 seconds and 68*0 for 5 minutes; followed by a 10-minute 
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filial extension at 68'a PCR products were examined on 1.2% agarose/EiBr gels in Ix TBE buffer. 
As a DNA size marker we used a 1 kb DNA Ladder (Ufe Technologies), 

Both human beta-actin and uansferrin receptor cDNA 5'-RACE reaaion generate a 
single band which correspond to the expected size of fulUength amplified 5'-RACE product. 
Subsequent cloning and sequence analysis of 18 randomly picked 5'-RACE clones confirm their 
identity to beta-actin and transferrin receptor 5'-end fragments. Moreover, 5'^nd sequences of 
amplified 5'-RACE produa exactly correspond to sequences of full-length beia-actin and 
transferrin receptor mRNAs followed by sequences corresponding to CAPswitch oligonucleotide. 
This example illustrates that CAPswitch 5'-RACE can be efficienUy used not only for 
amplification of fbU-length 5'-end sequences of cDNAs but also for exact mapping of 
transcriptional start sites. 

Example 2 - CAPswitch PCR-Based Technology for FuU-Length cDNA Library Construction 

Methods and materials of the present invention can be effectively used for construction 
of cDNA hT)raries using as a template 10-100 ng of total RNA Any conventional procedure well 
known in art can be used to purify this smaU amount of total RNA from 10-50 mg of "difficult" 
cells or tissues, like human biopsy tissues^ pathogenic microorganisms, tissues at different 
developmental stages and so on. The flow chart in Figure 3 shows the main step of this procedure. 
It will be apparent to those skilled in the art that some individual non-essential steps, structure 
of CDS primer, CAPswitch oligonucleotide and adaptors shown in Figure 3 can be varied without 
changing the efficiency of the whole procedure. For example, instead of the adaptor ligation step 
(step 3), ds cDNA generated by PGR can be digested by rare cutting restriction endonuclease(s) 
in sequences corresponding CDS and CAPswitch oligonucleotide flanking portions and cloned 
directly into veaor. Also, other conventional procedures well known in the art for direct cloning 
of PCR product^ such as TA-cloning veaor, blunt end ligation, and the like, can be used for 
cloning and generation of CAPswitch full-length cDNA Ilbiaries. Any such variations in the 
preferred protocol which are based on using methods and materials of the subject invention are 
within the scope of the invention. 

Step 1. First-strand synthesis ■ template switching . 10 pmol of cDNA synthesis primer 
(oligo d(T) primer) CDSl: 

5'-d(TCTAGAATrCAGCG<JCCGC(T)3oVN) - 3' (SEQ ID NO. 8) 

(where V=G or A or C; N=^G or A or T or C) and 10 pmol of CAPswitch oUgoaucleotide 
(CS02): 

CS02: 5'-d(CrAATACGACTCACrATAGGGC)r(GGG)-3' (SEQ ID NO, 13) 

were annealed to 100 ng of human skeletal muscle Total RNA (CLONTECH Laboratories, Inc.) 

in a volume of 5 fil of deionized water by heating the mixture for 2 minutes at 70°C, followed 
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by cooling on ice for 2 minutes. First-strand cDNA synthesis was then initiated by mixing the 
annealed primer-RNA with 200 units of M-MLV RNase H- reverse transcriptase (Superscript 
II reverse transcriptase, Life TechnDlogies) in a final voltune of 10/Wl» containing 50 mM Tris-HCl 
(pH 8.3 at 22*0); 75 mM KQ; 6 mM MgOa* ^ ^ ^^TJ ^ ^ ^'^^ (lOT?, UCTP, 

5 and dTTP, Tlie first-strand cDNA synthesis-template switching reaction was incubated at 42**C for 

1,5 hours in an air incubator and then cooled on ice. 

Step 2. Generation of full-len^h cDNA by PCR. PGR amplification of full-length cDNA 
was performed using the Advantage KlenTaq Polymerase Mix {CLONTECH Laboratories, Inc.). 
Amplification was conducted in a 100-/^l volume containing 2 //I of first-strand cDNA; 40 mM 

10 Tricine-KOH (pH 9,2 at 22**C); 3.5 mM Mg(OAc)2; 10 mM KOAc; 75 /zg/nol BSA; 200 /^M each 

of dATP, dOTP. dCTP, and dTTP; 0.2 flM each of CAPswltch primer (CSPl) and CDSl primer 
1; and 1 ml of KlenTaq Polymerase mix. Temperature parameters of the PGR reactions were as 
follows; 1 minute at 95°C foUowed by 20-22 cycles of 95*C for 15 seconds and 68*^0 for 5 minutes; 
followed by a lO-minute final extension at 6$°C. PGR products were examined on 1,2% 

15 agarose/EtBr gels in Ix TBE buffer. As a DNA size marker we used a 1 kb DNA Ladder (Life 

Technologies). 

Step 3. Adaptor ligation. The 50 fil of ds cDNA generated at the PGR step were 
combined with 2 fil of Proteinase K (2 mg/ml) and incubated at 45*C for 1 hour, foUowed by a 
denaturation step at 70*C for 10 minutes. Then, 3 fil (15 units) of T4 DNA polymerase were 

20 added to the reaction mixture and additionally incubated at 16*'C for 30 minutes, ds cDNfA vm 

then precipitated by addition of a lialf volume of 4 M ammonium acetate (about 35 ^l) and 3.7 
volumes of 95% ethanol (about 260 jul). After vortexing, the tube was immediately centrifuged 
at 14,000 r.p.m. in a microcentrifuge for 20 minutes. The pellet was washed with 80% ethanol 
without vortexing, centrifuged as above for 10 minutes, air dried, and dissolved in 16 jul of 

25 deionized water. The ds cDNA was then ligated to an adaptor overnight at 16*C under the 

foUowing conditions: 16 fil of ds cDNA solution, 50 mM Tris-HCl (pH 7.8 at 22*G), 10 mM 
MgGlj, 1 mM DTT, 1 mM ATP, 5% polyethylene glycol (M.W. 8.000), IjtiM of adaptor (Adl): 
Adl: 5' - d(AATTCGCGGGGGCGTGGAC) - 3' (SEQ ID NO, 14) 
3' - d(GCGCCGGGGGAGCTGp) - 5' (SEQ ID NO, 15) 

30 (where p = a 3'-phDSphate group) and 1 unit of T4 DNA ligase (Life Technologies) in a total 

volume of 30 jul The ligation mixture was then stopped by addition of 70 /il of 10 mM EDTA 
Theds cDNA was extmcted once with phenol/chloroform/isoamyl alcohol (25:24:1, volA^l), once 
with chloroform^oamyl alcohol (24:1, vol/vol), and then precipiuted by addition of lOyWl of 3 
M sodium acetate and 250 fil of 95% ethanol. After vortexing, the lube was immediately 
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centrifuged at 14,000 r.p.m. in a microcentrifuge for 20 minutes. The pellet was washed with 80% 
ethanol without vortodng, centrifuged as above for 10 minutes, air dried, and dissolved in 20/^1 
of deionized water. The adaptor ligated ds cDNA was then phosphoiylated at 3VC for 30 minutes 
under the following conditions: 20/<l of adaptor-ligated ds cDNA solution. 50 mM Tris-HCl (pH 
7.8 at 22*^0), 10 mM Mga2, 1 mM DTT, 1 mM AlP, 30 units of T4 polynucleotide kinase 
(Epicenter Technology) in a final volume 30 //I. Then phosphorylation reaction was lerminated 
by adding 2/il of 0.2 M EDTA and heat inactivated at 70°C for 15 minutes. 

Ste p 4. cPNA Size fractionation and cloning . Phosphorylated adaptor-ligated ds cDNA 
generated at the previous step was fractionated on the 1.2 ml Sephacryl S500 0 (Phamada) gel 
filualion column equilibrated by 10 mM Tris-HCl (pH 7.4), 30 mM NaCI. 0.5 mM EDTA Size 
distribution of cDNA in the fractions was analyzed by 1.1% agarose/BtBr gel alongside a 1 kb 
DNA size marker (life Technologies). Fractions corresponding to cDNA sizes longer than 0.5 kb 
were pooled together (total volume 250 /<1) and precipitated by adding 1/10 volume (25 /z\) of 3 
M sodium acetate, 1.5 fil of 20 mg/ml glycogen and 2.5 volume (400 fil) of 95% ethanol. 

After vortexing, the tube was immediately centrifuged at 14,000 r.p.m. in a microcentrifuge 
for 20 minutes. The pellet was washed with 80% ethanol without vortexing, centrifuged as above 
for 10 minutes, air dried, and dissolved in 15 fi\ of deionized water. The ds cBNA was then 
ligated to the Agtll EcoRl vector arm (CLONTECH Laboratories, Inc.) overnight at 16*C under 
the following conditions: 5 /ll of ds cDNA solution, 50 mM Tris-Ha (pH 7.8 at 22*C), 10 mM 
MgClj. 1 mM DTP, 1 mM ATP, 5% polyethylene glycol (M.W, 8,000), 2.5 of Agtll EcoKl 
arms, and 2.5 units of T4 DNA ligase (Life Technologies) in a total volume of 25 fil The ligation 
mixture was then packaged using standard protocols as described in the laboratory manual by 
Sambrook et al (1989). 

In order to confirm the high quality of the library generated using the methods and 
compositions of the present invention, 50 recombinant phage clones were seleaed at random for 
the deteimination of insert size. The size distribution of the inserts was in the range of 0.5-4.5 
kb with a maximum of 2.0-3,0 kb that correspond to the size distribution of skeletal muscle 
poly(A)-f'RNA in Total RNA used for cDNA libraiy construction. The same 50 inserts were 
sequenced using Delta Tth DNA polymerase Sequencing kit (CLONTECH Laboratories, Inc.). 
Ten of the sequences were identified in a search of the GenBank database. They are transferrin 
receptor, ribosomal protein L7, myosin light chain 2, LIM domain protein, ATPase factor 6, 
cytochrome C oxidase, cytoskeletal y-actin, smooth muscle a-actin, and smooth muscle y-actin. 
For three cDNAs the sequences of the clones were longer than published in OenBank. For seven 
cDNAs, their sequences exactly corresponded to full-length mRNA sequences starting from the 
cap site. 
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These data show thai methods and composiUons of the preseat invenUon can be used for 
cDNA Ubraiy construction to generate a high quaUty cDNA Ubrary with a very high level of full- 
length cDNA clones. 



Kxam ple 3 CAPswitch Full-Length cDNA Librafv Construction Using PRE Adapt or-Primer 
Strategy 

CAPswitch technology can be also effecUvely be combined with standard, conventional 
(non PCR-based) technologies well known in the art. As a result, conventional procedures can 

10 be significantly shnplified, and a full-length cDNA rather than a cDNA fragment Hbraiy wiU be 

generated. In this example, as a starting material for cDNA synthesis, we used poly(A)+RNA 
The flow chart in Figure 4 illustrates the main step of CAPswitch fuU-length cDNA Ubrary 
construction technology mainly based on the conventional PRE adaptor-primcr procedure 
described by Coledough et al, (1985). It wiU be apparent to those skilled in the art that choice 

15 of enzymes possessing similar enzymatic activities, structure of CDS primer (PRE adaptor primer). 

CAPswitch oligonucleotide and vector, and choice of restriction sites used for cloning, as shown 
in Figure 4, can be varied without changhig the efficiency of the subject procedure. One 
modification in the procedure can be to first cany out the second-suand synthesis (step 3) 
followed by restricUon digestion (step 2), and cloning into a vector (step 4). Another modificaUon 

20 can include using adaptor ligation procedure described in Example 2 instead of restricUon 

digestion (step 2). Use of the vector primer instead of the PRE adaptor-primer for the first- 
strand cDNA synthesis (step 1) can also be employed. In this case, the vector primer can have an 
oligo d(T) sequence at one end to initiate firat-strand synthesis and a CAPswitch oligonucleoUde 
sequence at the other end to provide automatic template switching after completion of full-length 

25 first-strand cDNA synthesis. Any such variations in tiie preferred protocol which use methods and 

materials of the subject invention are within the scope of the invention. 
Step 1. Generation of fiiU-lenrth mRNA: cDNA hybrid . 
10 pmol of cDNA synthesis primer (CDS3): 
5'. d(TCTAGAATTCTCGAGGCGGCCGC(T)3oVN> - 3' (SEQ ID NO. 16) 

30 3'- d(AGATCTTAAGAGCTCCGCCGGCG) - 3' (SEQ ID NO. 17) 

(where V=G or A or C; N=G or A or T or C), 

and 10 pmol of CAPswitch oligonucleotide (CS03): 

5'-dCTGCroCGAGAAGACGACAGAATrCGG)r(GGG)-3' (SEO ID NO. IS) 



35 



were annealed to 5 {ig of human skeletal muscle poly(A)+ RNA (CLONTECH Laboratories. 
Inc.). in a volume of 12^ /<1 of deionized water, by heating the mixture for 2 minutes at l^C, 
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followed by cooling on ice for 2 minutes. Firsi-strand cDN A synthesis-template switching was then 
initiated by mixing the annealed primer-RNA with lOOO units of M-MLV RNase H- reverse 
transcriptase (Superscript II reverse transcriptase. Life Technologies) in a final volume of 25 fi\, 
containing 50 mM Tris-HCl (pH 8,3 at ?2**C); 15 mM KQ; 6 mM MgCl2; 1 mM DTT; and 1 uiM 
each of dATP, dGTP, dCTP, and dTTP. The firsl-strand cDNA synthesis-template switching 
reaction was incubated at 42°C for 1.5 hours in an air incubator and stopp>ed by addition of 75 fi\ 
of 150 /4g/ml glycogen, 10 mM EDTA The mRNAcDNA hybrid was extracted once with 
phenol/chloroform/isoamyl alcohol (25:24:1, vol/vol), once with chloroform/isoamyl alcohol (24:1, 
vol/vol), and then precipitated by addition of a half volume of 4 M ammonium acetate (about 40 
/ll) and 3.7 volumes of 95% ethanol (about 300 yul). After vonexing, the tube was immediately 
centrifuged at 14,000 r*p.m. in a microcentrlfage for 20 niinutes. The peUet was washed with 80% 
ethanol without vortexing, centrifuged as above for 10 minutes, air dried, and dissolved in 50 pi\ 
of deionized water. 

Step 2. Restriction diitestion . mRNA'cDNA hybrid generated at step 1 was digested for 
nondireaional cloning by EcoRl restriction endonudease (EcoKl and Notl or EcoRl and Xhol 
for directional cloning) for 1 hour at 37*C in 100 ml of reaction mixture, containing 50 mM Tris- 
HCl (pH 7.5), 100 mM NaCl, 10 mM MgCl2, 1 mM DTT, and 50 units of EcoRl restriction 
endonuclease (New England BioLabs). The reaction was then stopped by addition of 5 /^l of 2 
mg/ml glycogen, 0,2 M EDTA The mRNAcDNA hybrid with EcoRl ends was extracted once 
with phenoJ/chloroform^oamyl alcohol (25:24:1, vol/vol), once with chloroform^oamyl alcohol 
(24:1, voVvol), and then precipitated by addition of a half volume of 4 M ammonium acetate 
(about 40/41) and 3.7 volumes of 95% ethanol (about 300 /it). After vortexing, the tube was 
immediately centrifuged at 14,000 r.p.m. in a microcentrifuge for 20 minutes. The pellet was 
washed with 80% ethanol without vortexing, centrifuged as above for 10 minutes, air dried, and 
dissolved In 5/<l of deioniaed water. 

Step 3. Ligation into vector . The ^coRI-digesied mRNAcDNA hybrid was then Ugated 
to the Agtll ^coRI vector arm (CLONTECH Laboratories, Inc.) overnight at 16*C under the 
following conditions: 5 ^1 of mRNAcDNA hybrid solution, 50 mM Tris-HCl (pH 7.8 at 22*C), 
10 mM MgQji 1 mM DTT, 1 mM ATP, 5% polyethylene glycol (M.W. 8,000). 2.5 of Agtll 
EcoRl arms, and Z5 units of T4 DNA ligase (Ufe Technologies) in a total volume of 20 fth 

Step 4. Second^strand cDNA synthesis . Second-strand cDNA synthesis was carried out 
in a total volume of 100 /*1, containing 20 fil of the vector-ligated mRNAcDNA hybrid, 20 mM 
Tris-HO (pH 7.5 at 22*'C), 100 mM KCl, 10 mM (NH4)2S04, 5 mM MgClj, 0.15 mM ^-NAD, 
50/*g/ml BSA, 300 units/ml R coli DNA polymerase I, 12 units/ml £. coli RNase H, and 60 
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units/ml K coU DNA Ugase. The reaction mixture was incubated at 16''C for 1.5 hours and stopped 
by addition of 4 ^! of 2 mg/ml glycogen, Q2 M EDTA. The ds cDNA was extracted once with 
phcnol/chloroform/isoamyl alcohol (25:24:1. vol/vol), once with chloroform/isoamyl alcohol (24:1, 
vol^ol)» and then precipitated by addition of a half volume of 4 M atnmonium acetate (about 35 
5 ^X) and 3.7 volumes of 95% ethanol (about 260 ^\). After vorteadng, the tube was inunediately 

cenuifuged at 14,000 r.p.m. in a microcentrifuge for 20 minutes. The pellet was washed with 80% 
ethanol without vortexing, centrifuged as above for 10 minutes, air dried, and dissolved in 10 /il 
of deionized water. 

The full-length cDNA library was then packaged using standard protocol described in the 
10 laboratoiy manual by Sambrook et ali\9Z9). In order to confirm the quality of the library 

generated by the methods and materials of the present invention, we carried out the same quality 
control experiments as in Example 2 for the PCR-based technology. The size distribution and high 
efficiency cloning of full-length cDNAs library were similar for both libraries. 

These data show that the methods and compositions of the present invention can be used 
15 for cDNA Ubrary construction based on PRE adaptor-primer strategy to generate high quality 

cDNA libraries with a very high level of fuU-lcngth cDNA clones. 

py^m plft 4 . Preferred Method for Amt>lification ds cDN A usine CAPswitch technologv and Using 
this ds cDNA in cDNA Subtraction . 

20 The methods and materials of the subject invention were also used for producing 

high-quality cDNA from nanogmms of total or poly A+ RNA. The How chart in Figure 1 (step 
I and 2) shows the main steps of this procedure. It will be apparent to those skilled hi the art that 
some modification in the preferred protocol, such as structure of CDS primer, CAPswitch 
oligonucleotide, time of addition different reaction component can be varied without changing the 

25 efficiency of the whole procedure. 

In one embodiment of the subject of invention the ds cDNA amplified by methods and 
materials of the present invention has been used in combination with cDNA subtraction 
technology. When total RNA is used for cDNA synthesis by conventional methods, ribosomal 
RNA is transcribed along with poly A+ RNA fraction, even if synthesis is oligo(dT)-primed. If 

30 this cDNA is used for cDNA subtraction procedure, the excess of ribosomal RNA» impurities of 

genomic DNA and low concentration of cDNA corresponding to the poly A+RNA fraction 
results in inefficient subtractivc hybridization. However, cDNA generated using methods and 
materials of the present invention can be direct^ used for subtractive hybridization procedure - 
even if total RNA was used as starting material. In a preferred sub-embodiment, tester and driver 

35 ds cDNA amplified by methods and materials of the present invention vm used in combination 

with Suppression Subtractive Hybridization technology (Chenchik et aL, U.S. Patent No. 
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5,565,340). Other methods of subtractive hybridization, described for example, by Wlgler et aU 
(U.S. Patent No, 5.436,142); Hampson a al (Nucl. Acids Res, 20:2899 (1992)); 'Yang tt aU (Anal. 
Biochem. 237:109-114(1996)); Balzer tt al (Nucl. Acids Res. 22:2653-2854(1994)), and others, can 
be employed using ss or ds cDNA oonesponding tester and/or driver and amplified by according 
to the methods of the present invention. 

Step 1. First-strand synthesis - template switching. 10 pmol of cDNA synthesis primer (oligo d(T) 
primer) CDS3: 5*-d(AAOCAGTGGTAACAACGCAGAGTAC(T)30 - 3' (SEQ ID 

N0.67), 

and 10 pmol of CAPswitch oligonucleotide (NalsmGS): 
5'-d(AAGCAGTGGTAACAACGCAGAGTACGC)r(GGG) - 3' (SEQ ID N0.68), 

were annealed in two separate test tubes to 1 mg of each tester and driver Total RNA 
(CLONTECH Laboratories, Ina), in a volume of 5 ml of delonized water, by heating the mixture 
for 2 min at 70OC, followed by cooling on ice for 2 min. First-strand cDNA synthesis was then 
initiated by mixing the annealed primer-RNA with 200 units of M-MLV RNAse H- reverse 
transcriptase (Superscript II reverse transcriptase, life Technologies) in a final volume of 10 ml, 
containing 50 mM Tris-HQ (pH 8.3 at 22oC), 75 mM Ka, 6 mM MgC12 , 1 mM DTT, 1 mM of 
each dATP, dGTP, dCTP and dTTP. The first-strand cDNA synthesis-template switching reaction 
was incubated at 42oC for 1.5 hr in an air incubator, then diluted by adding the 40 ml of TE 
buffer (10 mM Tris [pH 7.6], 1 mM EDTA) and healed at 720C for 7 min. 

Step 2. cDNA Amplification 

PGR amplification of full-length cDNA was performed using the Advantage KlenTaq Polymerase 
Mix (CUDNTECH Laboratories, Inc.). Amplification was conducted in a 100-ml volume 
containing 5 ml of diluted first-strand cDNA, 40 mM Tricine-KOH (pH 9.2 at 220C), 3.5 mM 
Mg(OAc)2. 10 mM KOAc, 75 mg/ml BSA, 200 mM of each dATP, dGTP, dCTP and dTTP. 0.2 
mM of PGR primer (Nalsm): 5'- 
AAGCAGTGGTAACAACGCAGAGT - 3* (SEQ ID N0.69) 

and 2 ttd of KlenTaq Polymerase mix. Temperature parameters of the FOR reactions were as 
foUows: 1 min at 95oC followed by 17-19 cycles of 95oC for 15 sec and 68oC for 5 mm; followed 
by a 10-min final ©aension at 68oC PGR products were examined on 1.2% agarose/EtBr gels in 
Ix TAB buffer. As a DNA size marker we used a 1 Kb DNA Ladder (Life Technologies). 
Terminate reaction by adding 2 ml of 0^ M EDTA. 
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Step 3* cDNA purification. 

To tester and driver PCR product combined each from two reaction tubes, add an equal 
volume of phenol:choloroform:isoamyl alcohol (25:24:1). Vortex thoroughly. Centrifuge the tubes 
5 at 14,000 rpm for 10 min to separate the phases. Remove the top (aqueous) layer and place it in 

a clean 1.5-inl tube. Add 700 ml of n-butanol and vortex the mix thoroughly. Butanol extraction 
allows you to concentrate PCR product to a volume of 40-70 mi. Centrifuge the solution at room 
temperature at 14,000 rpm for 1 min, Remove and discard the upper (n-butanol organic) 
phase. If you do not end up with a volume of 40-70 ml, repeal butanol extraction step with the 
10 same volume of n-buunol. Invert a 0.75 ml CHROMA SPIN-1000 (CLONTHCH 

Laboratories, Inc.) column several times lo completely resuspend the gel matrfac Remove the top 
cap from the column, and then remove the bottom cap. Place the column into a 1.5-ml centrifuge 
lube or a 17 X lOO mm tube. Discard any column buffer that immediately collects in the tube and 
add 1.5 ml of IX TNE buffer (10 mM Tris-HCl (pH 8.0), 10 mM NaQ, 0.1 mM EDTA). Let the 
15 buffer drain through the column by gravity flow until you can see the sur&ce of the gel beads in 

the column maubc Discard the collected buffer and proceed with purification. Carefully and slowly 
apply the sample to the center of the gel beds flat surface. Do not allow any sample to flow along 
the iimer wall of the column. Apply 25 ml of IX IKE buffer and allow the buffer to completely 
drain out of the column. Apply 150 ml of IX TOE buffer and allow the buffer to completely drain 
20 out of the column. Transfer column to a clean 1.5-inl microcentrifuge tube. Apply 320 ml of IX 

TNE buffer and collect this as your purified cDNA fraction. To confirm that your PCR product 
is present in the purified cDNA fraction, perform the agarose/EtBr gel analysis as described in 
Step Z 

25 Step 4. cDNA Subtraction. 

Perform an Rsa I digestion and cDKA subtraction as described in details by Chenchik et 
aL (U.S. Patent No. 5,565,340). 

30 

It should be understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of this 
35 application and the scope of the appended claims. 
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SEQUENCE LISTINa 

(1) GENERAL INFORMATION: 

CLONTECH LABOHATORIES, INC, 
1020 East Meadow Circle 
Palo Alto 
California 
US 

94303 

(415) 424-8222 Fax: (415) 424-1352 



(i) APPLICANT INFORMATION! 
Appl leant Name ( s ) s 
Street addreeat 
citys 

state/Province t 
Country: 
postal code/2IPt 
Phone number: 



(11) TITLE OF INVENTION I METHOD FOR FULL-LENGTH CMA CLONING 
(ill) NUMBER OF SEQUENCES: 66 

rlv> CORRBSPONDEKCE ADDRESS: 

{A^ ADDRESSEBi Saliwanchlk & Saliwanchik 

(B) STREET: 2421 N.W. 41st Street, Suite A-1 

(C) CITY: Gaineovllle 

(D) STATE 1 Florida 

(E) COUNTRY: USA 

(F) ZIP: 32606 

(V) COMPUTER READABLE FORM: 

<A) MEDIUM TYPE: Floppy disk 
{B> COMPUTERS IBM PC compatible 

(C) OPERATING SYSTEM: PC-BOS/MS-DOS 

(D) SOFTWARE* Patentln Release #1.0, Version #l*-ii) 

(vl) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vlii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Saliwanchlk, David R* 

(B) REGISTRATION NUMBER: 31,794 

(C) REFERENCE/DOCKET NUMBER: CL-7 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (904) 375-8100 
<B) telefax: (904) 372-5800 



3' 

- 3' 



SEO ID NO. 1: CzRl 5*- d(TGTAGCGTGAAGACGACAGAA)r(N)i2 - 3 

llo II NO 2 C^ 5'- d TGTAOCGTGAAGACGACAGAACN 11 r(NU 

110 ID NO. 3 C^ 5'- d TGTAOCGTGAAGACGACAGAA{N)ii) - 3^^^ 

111 ID NO 4: CzR4 5'- d(TGTAGCGTOAAGACGACAGAAGGAT6(N)9) r (N) i - 3 
III II 5 Na21-N4 5'- d TGTAGCGTGAAGACOACAGAA)r N 4 " 3 

Ipo Tn NO 6- Na21-N8 5'- d{TGTAG0GTOAAGACGACA6AA) r (N) e - 3 

110 II ?; NM1-N12 5'- d TGTAGCGTGAAGACGACAGAA)r(N)ie -3' 

II l\ CDSl: -'-^i-<l^^^^^ oA or T or C) 

SEQ ID NO. 9: CSOl: ^'"^C™^-^^^^ 

SEO ID NO. lOt ACTl: 5' -ACTCQTCATACTCCTGCTTGCTGATCCACATCTGC - 3 

111 11 NO lU S; 5' - OTCAATGTCCCAAACGTCACCAG^^^^ 3' 
SEO ID NO- 12: CSPlJ 5'- d(CTAATAOGACTCACTATAGGGC)- 3 
sIq II lo. 13: CS02: 5 '.d(CTAATACGACTCACTAT^C)r (GGG) -3 
SEQ ID NO. 14: Adl: 5' - d ( AATTCGCGGCCOCGTCGAC ) - 3 

SEO ID NO. 15: 3' - d{GCGCCOGCOCAGCTGp) - 5 

SEQ ID NO. lb: (Where p - 3 '-phosphate group) 

SEQ ID NO. 16: CDS2t 5'- d(TCTAGAATTCTCGAGGCGGCCGC(T)3oVN) - 3 
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SBQ ID NO, 17: 

SEQ ID NO, 18: CS03t 



3'- d(AGATCTTAAGAGCTCCGCCGGCG) - 3' 

(Where V«G or A or C; N=G or A or T or C) 
5 ' -d ( TG CTGCGAGAAO ACGACAGAATTCGG ) r ( GGG ) - 3 ' 



Additional CAPswitch oligonucleotidaB: 



SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SBQ ID NO. 
SEQ ID NO, 
SEQ ID NO. 
SEQ ID NO. 
SEQ ZD NO. 
SEQ ID NO. 
SEQ ID NO, 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SBQ ID NO* 
SEQ ID NO. 
SEQ XD NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO, 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SBQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO 



3' 



23: 
24: 
25: 
26: 
27: 



Na21-^TAA03 
Na21-GATTG3 
Na21-TGTTG3 
Na21-CTAAG3 
Na21-GGTAG3 



19: Na21-G 5'- d(TGTAGCGT6AAGACGACAG7Ul)r(G} - 3' 
20: Na21-G3 5'- d(TGTAGCGTGAAGACGACAGAA)r(GGG) - 3' 
21: Na21-N4G3 5'- d(TGTAGCGTGAAGACGACAGAA)r (N4G3) - 3' 
22: Na21-GCGGCW4G3 5'- d(TGTAGCQTGAAGACGACAGAA)r (OCGGCN4O3) - 

5'- d(TGTAGCGTGAAGACGACAGAA)r(GTAAG3) - 3' 
5'- d(TGTAGCGTGAAGACGACAGAA)r(GATTG3) - 3' 
5'- d(TGTA6CGTGAAGACGACAGAA)r(TGTTG3) - 3' 
5'- d(TGTAGCGT6AAGACGACAGAA)r(CTAAG3) - 3' 
5'- d(TG!i:AGCGTGAAGA0GACAGAA)r(GGTAG3) - 3' 
5'- d(TGTAGCGTGAAGAOGACAGAA)r(GGp) - 3' 
5'- d(TGTAG0GTGAAGAC6ACAGAA)r(GGGp) - 3' 
5'- d{TGTAGCGT6AAGAOOACAGAA)r(GGGaGp) - 3' 
5'- d(TOATGCGAGTAGA0GACAGAA)r(G6G) - 3' 
5'- d(TOATOCGAGTAGACGACAGAA)r(GGGp) - 3' 
5'- d{TGATGCGAGTAGACGACAGA)r{GGGp) - 3' 
5'- d(TACGATGCGAGTAOACGACAGAA)r(GGGp) - 3' 
5'- d{TOCTO0GAGAAQACGACAGAA)r(GGO) - 3' 
5'- d{TOCTG0GAGAAOACGACAGAA)r(GGGp) - 3' 
5'- d(TTGCTGGCAGAAGACGACAGA)r{G6G) - 3' 
5'- d{CTAATACGACTCACTATAGGGC)r(G) - 3' 
5'- d{CTAATACGACTCACTATAGGGC)r<GG) - 3' 
5'- d(CTAATACGACTCACTATAGOGC)r<GOG) - 3' 
5'- d(CTAATACOACTCACTATAGGGC)r{GGGGG) - 3' 
5'- d(CTAATACGACTCACTATAGGGC)r{Gp) - 3' 
5'- d(CTAATACGACTCACTATAGGGC)r(GGp) - 3 
5'- d(CTAATACGACTCACTATAGGGC)r(GQGp) - 3' 
5'- d(CTAATACGACrcACTATAGGGC)r (GGGGGp) - 3' 
5'* d(CTAATACGACTCACTATAGGGC)r(GCG) - 3' 
5'- d(CTAATACGACTCACTATAG6GC)r<GCGG) - 3' 
5'- d(CTAATACGACTCACTATAGGGC)r (06) - 3' 
5'- d(CTAATACGACTCACTArrA)r(QGGOG) - 3' 
5'- d(CTAATACGACTCACTATAGGGC)r<N9GGG) - 3' 
5'- d(CTAATACOACTCACTATAGGGC)r{GCGGG) - 3' 
d(CTAATACGACTCACTATAGGGCGCG0CCGCCCGGG)r(GCG3) -3' 



2B; Na21-62p 
29: Na21-G3p 
30: Na21-G5p 
31: Na21N-G3 
32: Na21N*G3p 
33 1 Na21ND-G3p 
34: Na21B-G3p 
Na22-G3 
Na22*G3p 
Na22M-G3 
T7--G 
T7-G2 
T7-«3 
T7-G5 
T7-Gp 
T7-G2p 
T7-G3p 
^7-05? 
46: T7-GC0 
47: T7-GCG2 
48: T7-CG 
49; T7-DG 
50: T7-N963 
51s T7-OC03 
52: T7-SUP1 



35 X 
36: 
37: 
38: 
39: 
40: 
41: 
42: 
43: 
44i 
45: 



5'- 



SEQIDNO* 53; T7-SUP2 5 '-d(CTAATACGACTCACTATAGGGCACGCGTGCXCGACGGCCCGG) r <GCG3)-3 



SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 



54: 
55: 
56: 



T7-G3-NH2 
T7-G3-HI0 
T7-G3-GI.Y 



SEQ ID NO.S7S T7-GA03p 

SEQ ID NO* 58: T7-GTG3p 

SEQ ID NO. 59: T7-GGAG2p 

SEQ ID NO. 60: T7-GGTG2P 

SEQ ID NO. 61: T7-GACG2P 

SEQ ID NO. 62: T7-GATG2p 

SEQ ID NO. 63: T7-OOTG2p 

SEQ ID NO. 64: T7-GA6TGp 

SEQ ID NO. 65: T7K-GGAG3p 

SEQ ID NO. 66: T7-GAG3p 



5'- d(CTAATACGACTCACTATAGGGC)r (GGG-NHj) - 3' 
5'- d(Ca?AATACGACTCACTATAGGGC)r(GGG-BIO) - 3' 
5'- d(CTAAafACGACTCACTATAGGGC)r (GGG-GLY) - 3' 
(where NH2r BIO and GLY are reapectively amino, 
biotin and glycerol group at the 3 'position 
of ribose residue) 
5'- d(CTAATACGACTCACTATAGGGC)r(GAGGGp) 
5 ' - d ( CTAATACGACTCACTATAGGGC ) r < GTGGGp ) 
5 ' - d ( CTAATAC6ACTCACTATAGGGC ) r ( GGAGGp ) 
5 ' - d ( CTAATACGACTCACTATAGGGC ) r ( GGTGGp ) 
5 ' " d ( CTAATACGACTCACTATAGQGC ) r ( GACGOp ) 
5 ' - d ( CTAATACGACTCACTATAG6GC ) r ( GATGOp ) 
5 * - d ( CTAATACGACTCACTATAGGGC ) r ( GTTGGp ) 
5 ' - d { CTAATACGACTCACTATAGGGC ) r ( GAGTGp ) 
S'- d(TCCTAATACGACTCACTATA)r(GGAGOGp) 



- 3' 
. 3' 

- 3' 

- 3' 

- 3' 

- 3' 

- 3' 

- 3' 
3' 



5'- d ( CTAATACGACTCACTATAGGGC )r{GAGGOp) - 3' 



SEQ ID NO. 67: CDS3 5 '*d{AAGCAGTGGTAACAACGCAGAGTAC(T)3o) - 3' 

SEQ ID WO. 68: NalemGS 5'-d{AAGCAGTGGTAACAACGCAGAGTACGC)r (GGG) - 3* 



SEQ ID NO. 69: 
SEQ ID NO. 70: 
SEQ ID NO. 71: 
SBQ ID NO. 72: 
SEQ ID NO. 73: 



Nalem 5*- d(AAGCAGTGGTAACAACGCAGAGT) - 3' 
Na21-Bio 5'- d(Bio-TOTAOCGTOAAGACGACAGAA) - 3' 
T7N 5'- d( CTAATACGACTCACTATAGGGC) - 3' 
EXT7 5'- d(TGCCATCCTAATACGACTCACTA) - 3' 
T7MPolc 5'- d(CTAATACGACTCACGGATGGGC) - 3' 
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SEQ ID N0.74! Na21N 5'- cUTGATGCGAQTAGACGACAGAR) - 3' 

TD NO 75! Ha22 5'- d ( TGCTGCGRGRAGACGACAG3UV ) - 3' 

lloMH0 76 Kt22M S'- d(TTGCTGGCAOaAGACOACAOA) - 3' 

SEQ "So 7?; Na21B S- d<TACGATGCGAGTAG^^ACMJUV) - 3' 

SEO ID N0.78J Frl 5'- d(TGACCAGT0A0t»GAGT6ACGA) - 3' 

110 ID NO 79 T3 5'- d(Ct»TCCRATTAACCCTCACTAAAG6GC) - 3- 
lloiDNO aO NalM 5'- d(AAGCa«SA0OCRACaACGCRGA) - 3- 

111 ll N0.81: T7M2 5'- d(ACAA0ACGAAGCAC5AAGAGGGC) - 3' 
lis 82 HalH 5'- d(AAGCAOAGGCRACJ^CGCRGA) - 3' 
%vn TD NO 83 » T7T S'- dlTTCCGCTTGTCTGCTGGGC - 3' 

SEQ ID N0.871 RT/^oic ^ ^.cTAATACGACTCACTATAGGQC)d<III ) - 3' 

r(CTAATACGACTCACTATAOGGC)d (UUU)- 3' ("^1 



S£Q TO M0*66t 

I 111 S» |:i<^^S»Sfi^! : t: 

5^?d(MACGACTa^CTATACM3GCTCGAGCTC - 3' 

l^?d^LSTOASTkc?ATASM^ 

SEQ ID KO.107$ 3BP-NSMT 7* 
5'-<a<TTCOGCrrTGTCn«CT0GGCTCGTGCGGCC«^ - ^ 

1^ M MO lioi T7-T30NN S ' -d CCATOCTAATACOACTCaiCTATAGGGC ( T ) -3 
iio " Bio:i7-T30HH ^.d(Blo-AATACGACTCACTRTAGGOC(!t),oVN) -3' 

SEQ ID NO. 112s 5'-d(TGCTOCaOAAQACaACAGAA)r(GGG)-3' 

II S§: nil i::^!S^^Sr^-3' .whe.e V.A,G, o. C- and wh«e 

N*A,G^Cr or T) • 
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Claims 

We claim: 

X 1. A method for preparing DNA that is complementary to the 5*-en(l of aa RNA 

2 molecule, said method comprising annealing a cDNA synthesis primer to said RNA molecule and 

3 synthesizing a first DNA strand that is complementary to at least a portion of said RNA molecule, 

4 and contacting said RNA molecule with a template switching oUgonudeoiide having a pre-seleaed 

5 arbitrary nucleotide sequence at its 5*-end and at least one riboguanine residue at its 3*-end, 

6 wherein said template switching oligonucleotide attaches to the 5*-end of said RNA and serves 

7 as a template for extension of the 3'-end of said first DNA strand. 

1 2. The method, according to claim 1, further comprising ampliQring said first DNA 

2 strand using a second oligonucleotide primer that comprises a nucleotide sequence that is 

3 complementary to at least a portion of said prc-selected nucleotide sequence of said template 

4 switching oligonucleotide. 

1 3. The method, according to claim 1, wherein said RNA molecule has a 7- 

2 methylguanosine CAP structure attached at the 5*-end of said RNA 

1 4. The method, according to claim 1, wherein said template switching 

2 oligonucleotide has at least three riboguanine residues at its 3'-end. 

1 5. The method, according to claim 1, wherein the 3'-OH group of ribose of said 

2 riboguanine residue comprises a chemical group selected from the group consisting of amino, 

3 biotin, phosphate, glycerol, and fluoro group. 

1 6. The method, according to claim 1, wherein said template switching 

2 oligonucleotide comprises a protein that binds to the 5* CAP structure of said RNA 

1 7. The method, according to claim 6, wherein said protein is selected from the group 

2 consisting of etUcaryotic initiation factor 4E and anti-CAP structure antibody. 

1 8. The method, according to claim 1, wherein said template switching 

2 oligonucleotide is represented by the formula: 

3 5*-dNl-dN2-.„dNm-rNl-rN^.jrNn-3*. 

4 wherein dN represents a deoxyribonucleotide selected from the group consisting of dAMP, dCMP, 

5 dGMPand dTMP; 
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in represents an integer 0 and above; 

rN represents a ribonucleotide selected from among AMP. CMP, GMP and UMP; and n 
represents an integer 1 and above. 

9. The method, according to ctoim 1. wherein said template switching 
oligonucleotide is selected from group consisting of SEQ ©. 
NO.l,2,M.5,6.7.9.13.18.19.20,21,22,23.24.25,26,27.28,2930.31,3233.343536,3738,39,40.41,4^^^^ 
44.45.46.47,48.49^0,51,52,53,54,55,56,57,58,59.60.61.62,<3,M,65,66.68,84,85,86.87.88M^^ 

93,94,95,96,97.98.99,100,101,102.103,104,105,106,107 and 108. 

10. The method, accordingto claim 1. whercinsaid cDNAsynihesis primer comprises 
a sequence complementary to at least one target mRNA. 

11. The method, according to claim 1. wherein said cDNA synthesis primer comprises a 
sequence complementary to the poly(A) portion of polyA+RNA. 

12. -me method, according to claim 1. wherein said cDNA synthesis primer is selected 
from group consisting of SEQ. ID. NO. 8,16,67,109,110, and 111. 

13. -me method, accxwding to chiim 1. wherein said full-length anchored cDNA contain 
at least one hapten group and further comprising the steps of incubating said haptenylated 
anchored cDNAwith the bindtag Ugand of the hapten of said haptenylated anchored cDNA. said 
binding Ugand being conjugated to support; said incubaUon being under condition sufficient to 
permit said haptenylated anchored cDNA probe to become bound to said binding Ugand of said 
support; then recovering said haptenylated anchored cDNA bound to support from any unbound 
molecules and incubating said haptenytated anchored cDNA probe bound to support under 
conditions sufficient to release said haptenylated ahchored cDNA. 

14. The method, accordmg to claim 1, further comprising the step of treating said 
full-length anchored cDNA. generated at step (b) under conditions in which a second cDNA 
strand is synthesized by DNA polymerase, using the said full-length anchored cDNA suand as a 
template, thereby genorating full-length ds cDNA 

15. The method, according to claim 1, further comprising the steps of incubating said 
fttU-length anchored cDNA. generated at step (b) with a combination of primers comprising a 
nucleotide sequence that is corresponded to at least portion of the said template^tching 
oUgonudeotide and of the said cDNA synthesis primer, respectWely and an effective amount of 
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5 otlier reagents necessaiy to perform a PGR; the incubation being under conditions sufiScient to 

6 perform PGR to thereby generate a full-length ds cDNA 

1 16. The method, according to claim 15, wherein said primers are selected from the 

2 gronp consisting of SEQ. ID. NO. iail42,17,69»70.71.72»73,74.75,76J7,78.79,80»81,82.» and 83. 

1 17, The method, according to claim 15^ further comprising the steps of labelling said 

2 full-length ds cDNA by reporter groups; contacting said labelled full-length ds cDNA with the 

3 defined oligonucleotide/polynucleotides corresponding particular set of the genes immobilized on 

4 the solid suppon under conditions enough to permit the formation of patterns of hybridization 

5 on the surfaces caused by hybridization between certain potynudeotide sequences in the said 

6 labelled fiiU-length ds cDNAwith the certain immobilized defined oligonucleotide/polynucleotides; 

7 reveal said these patterns by detection of said reporter groups. 

1 18. A template switching oligonucleotide comprising a pre-selected nucleotide 

2 sequence at its S'-end and at least one riboguanine residue at its 3'-end. 

1 19. The template switching oligonucleotide, aca>rding to claim 18, wherein said 

2 oligonucleotide comprises at least three riboguanine residues at its 3'-end. 

1 20. A template switching oligonucleotide, according to claim 18, represented by the 

2 formula: 

3 5'-dNl-dN2-...dNm-rNl-rN2...rNn-3\ 

4 wherein dN represents a deoxyribonucleotide selected from the group consisting of dAMP, dCMP, 

5 dGMPanddTMP; 

6 m represents an integer 0 and above; 

7 rN represents a ribonucleotide selected from among AMP, GMP, GMP and UMP; and n 

8 represents an integer 1 and above. 

1 21. The template switching oligonucleotide, according to claim 18, wherein said 

2 template*switching oligonucleotide is seleaed from group coiisisting of S£0 ID. 

3 NO.l,23.4,5.6,7,9,13,18,19,20,21,22;j3,24,25,26,27,28,29,30313233.343^ 

4 44,45,46^47,48,49,50,51,5243,54,55,56,57,58,59,60,61,62,63,64,65.66,68,84,8^^ 

5 93,94,95,96,97,98,99,100,101,102,103,104,105,106,107 and 108. 
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1 22, The template switching oligonucleotide, according to daixn 18, wberein the 3'-OH 

2 group of rihose of said riboguanine residue comprises a chemical group selected from the group 

3 consisting of amino, biotin. phosphate^ glycerol, and fluoro group. 

1 23, The template switching oUgonucleotide. according to claim 18, wherein said 

2 template switching oUgonucleotide comprises a protein that binds to the 5" CAP structure of said 

3 RNA. 

1 24. The template switcliing oUgonucleotide, according to claim 18, wherein said 

2 protein is selected from the group consisting of etikaryotic initiation factor 4E and anti-CAP 

3 structure antibody. 

1 25. A kit comprising in one or more compartments a template switching 

2 oligonucleotide according to claim 18. 
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